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Abstract. In this paper we consider an optimal control problem governed by 
a time-dependent variational inequality arising in quasistatic plasticity with 
linear kinematic hardening. We address certain continuity properties of the 
forward operator, which imply the existence of an optimal control. Moreover, 
a discretization in time is derived and we show that every local minimizer 
of the continuous problem can be approximated by minimizers of modified, 
time-discrete problems. 



1 Introduction 

The optimization of elastoplastic systems is of significant importance for indus- 
trial deformation processes, e.g., for the control of the springback of deep-drawn 
metal sheets. In this paper we consider an optimal control problem for the qua- 
sistatic problem of small-strain elastoplasticity with linear kinematic hardening. 
The strong formulation of the forward system (in the stress based, so-called dual 
formulation) reads (cf. [Han and Reddy, 1999, Chapters 2, 3]) 

C _1 <7 - e(u) + A (cr D + x D ) = in (0, T) x CI, 
H- 1 * +\(<T D + X D ) = m(0,T)xfl, 



diver = -/ in (0,T) x ft, 

with complem. conditions < A _L 0(E) < in (0,T) x fi, 

and boundary conditions u = on (0, T) x Trj, 

cr ■ n = g on (0, T) x Tfj. 

The system (1.1) is subject to the initial condition (cr(0), %(0), u(0)) = 0. The state 
variables consist of the stress cr and back stress x, whose values are symmetric 
matrices. Both matrix functions are combined into the generalized stress E = 
(cr, x). The state variables also comprise the vector- valued displacement u and 
the scalar-valued plastic multiplier A associated with the yield condition 0(E) < 
of von Mises type, see (1.5). The first two equations in (1.1), together with the 
complementarity conditions, represent the material law of quasistatic plasticity. 
The tensors C _1 and H _1 are the inverses of the elasticity tensor and the hardening 
modulus, respectively, and cr D denotes the deviatoric part of cr, see (1.6). As usual, 
e(v) — (Vt> + (Vt)) T ) /2 denotes the (linearized) strain, where V« is the gradient of 
v. The third equation in (1.1) is the equilibrium of forces. By diver we denote the 
(row- wise) divergence of the matrix- valued function er. The boundary conditions 
correspond to clamping on T o C dil and the prescription of boundary loads g on 
the remainder IV = dfl \Td, whose outer unit normal vector is denoted by n. We 
will see in Section 1.2 that (1.1) can be reformulated as a mixed, time-dependent 
and rate-independent variational inequality of the first kind. 
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The boundary loads g act as control variables. There would be no difficulty in 
admitting volume forces / as additional control variables but for practical reasons 
and simplicity of the presentation, / is assumed to be zero throughout. The optimal 
control problem under consideration reads 



Here, ip is a functional favoring certain displacements u. A typical example would 
be to track the final deformation after unloading. The set t/ a d realizes constraints 
on the control g. The assumptions on ip and U&d are given in Assumption 2.8, 
followed by a number of examples. 

The aim of this paper and of the subsequent works Wachsmuth [2012a,b] is the 
derivation of first order necessary optimality conditions for (P). Let us highlight 
the main contributions of this paper. 

(1) We show that the derivative (E, ii) of the solution depends continuously on 
the derivative g of the right-hand side, see Corollary 2.4. This continuity 
result of the solution operator of (the weak formulation of) (1.1) is novel 
compared to the stability results known from the literature. 

(2) Since the solution map of the system (1.1) is nonlinear, its weak continuity is 
not obvious. We prove this weak continuity in Theorem 2.7. An immediate 
consequence is the existence of optimal controls of (P), see Theorem 2.9. 

(3) We show the convergence of the solutions of a time-discrete variant of (1.1) 
w.r.t. the time-step size. In particular, we prove a rate of convergence of the 
generalized stresses X in L°°(0,T) without assuming additional regularity, 
see Lemma 3.2. Moreover, the convergence of the plastic multiplier A is 
shown in Theorem 3.4. These findings improve the results known from the 
literature. 

(4) We study the approximation of local minima of (P) by local minima of 
its time-discrete variant, see Section 3.4. The results of this section are 
interesting in their own right and they are also used in order to derive 
necessary optimality conditions of (P), see [Wachsmuth, 2012b, Section 3]. 
Moreover, these approximation properties are important for a numerical 
realization of the optimal control problem. 

There are only few references concerning the optimal control of rate- independent 
systems, see Rindler [2008, 2009], Kocvara and Outrata [2005], Kocvara et al. [2006], 
Brokate [1987]. In the case of an infinite dimensional state space (i.e. W 1 ' P {Q 1 T; X), 
dimX = oo), there are no contributions providing optimality conditions for rate- 
independent optimal control problems. Rindler [2008, 2009] study the optimal 
control of rate-independent evolution processes in a general setting. The existence 
of an optimal control and the approximability by solutions of discretized prob- 
lems is shown. The system of quasistatic plasticity in its primal formulation (see 
[Han and Reddy, 1999, Section 7]) is contained as a special case. However, in con- 
trast to our analysis, the boundedness of the control in W 1,o °(0, T; U) (cf. [Rindler, 
2008, Assumption (U)[) is required and no optimality conditions are proven. 

A finite dimensional situation (i.e. with state space M /1,p (0, T; R™)) is considered 
in Kocvara and Outrata [2005] , Kocvara et al. [2006] , who consider spatially dis- 
cretized problems, and Brokate [1987], who deals with optimal control of an ODE 
involving a rate-independent part. 




such that (X, u, A) solves the quasistatic plasticity problem (1.1), 
and g e U ad . 



(P) 
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The static (i.e. time-independent) version of the optimal control problem (P) 
was considered in Herzog et al. [to appear, 2011c]. For locally optimal controls, 
optimality systems of B- and C-stationarity type were obtained. 

Let us sketch the outline of this paper. In the remainder of this section, we 
introduce the notation and fix the functional analytic framework. Moreover, we 
state the weak formulation of (1.1) and give some references concerning its analysis. 
Section 2 is devoted to the continuous optimal control problem (P). We give a 
brief introduction to evolution variational inequalities (EVIs). Restating (1.1) as 
an EVI, we are able to prove some new continuity properties of its solution operator. 
Moreover, we show the weak continuity of the control-to-state map of (P). 

In Section 3 we consider a discretization in time of the control problem (P). 
We show two convergence properties of the solution of the time-discrete forward 
problems, similar to the strong and weak continuity of the solution map of (1.1). 
Finally, these results are used in Section 3.4 to prove that every local minimizer g 
of the control problem (P) can be approximated by local minimizers of a slightly 
modified time-discrete problem (Pg), see Theorem 3.10. 

This paper can be understood as a prerequisite for the analysis contained in 
Wachsmuth [2012a,b]. Since the problems under consideration and their analysis 
are non-standard, the results presented here are believed to be of independent 
interest. 

We give a brief overview over Wachsmuth [2012a,b]. We regularize the time- 
discrete forward problem and show the Frechet differentiability of the associated 
solution map. This result requires some subtle arguments. The regularized time- 
discrete optimal control problems are differentiable and consequently, optimality 
conditions can be derived in a straightforward way. The passage to the limit in 
the regularization parameter e leads to an optimality system of C-stationary type 
for the time-discrete problem. Finally, we pass to the limit with respect to the 
discretization in time. This part also requires new convergence arguments. Due to 
the weak mode of convergence of the adjoint variables, the sign condition for the 
multipliers is lost in the limit and we finally obtain a system of weak stationarity 
for the optimal control of (1.1). 

1.1. Notation and assumptions. Our notation follows Han and Reddy [1999] 
and Herzog et al. [to appear]. 

Function spaces. Let Q, C M. d be a bounded Lipschitz domain with boundary 
r = <90 in dimension d = 3. The boundary consists of two disjoint parts Tjv and 
Tr>. We point out that the presented analysis is not restricted to the case d = 3, 
but for reasons of physical interpretation we focus on the three dimensional case. In 
dimension d = 2, the interpretation of (1.1) has to be slightly modified, depending 
on whether one considers the plane strain or plane stress formulation. 

We denote by S :— the space of symmetric d-hy-d matrices, endowed with 

the inner product cr : r = a ij T ij> anc ^ we define 

V = H}y(n;R d ) = {ue H 1 (n;R d ) : u = on T D }, S = L 2 (Q;§) 

as the spaces for the displacement u, stress cr, and back stress Xi respectively. The 
control g belongs to the space 



U = L 2 (T N ;R d ). 
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The control operator E : U — > V , g H> £, which maps boundary forces (i.e. controls) 
g € U to functionals (i.e. right-hand sides of the weak formulation of (1.1), see 
Section 1.2) £ £ V is given by 

(v, Eg) v ,v> := — / v-gds for all w € V. (1.2) 

Hence, E — — t^, where tjv is the trace operator from V to U = L 2 (T^;'R d ). 
Clearly, E : U — > V is compact. 

Starting with Section 2.2, we will omit the indices on the duality bracket (•, •). 
We denote by (•, •) the dual pairing between V and its dual V', or the scalar 
products in S or S 2 , respectively. This will simplify the notation and cause no 
ambiguities. 

For a Banach space X and p <E [1, oo], we define the Bochner-Lebesgue space 

L p (0, T; X) — {u : [0, T] — > X, u is Bochner measurable and p-integrable}. 

In the case p = oo one has to replace p-integrability by essential boundedness. The 
norm in L p (0, T; X) is given by 

IMUp(o,t ; X) = ||ll M (-)llx|| iP(0Tr 

By W /1,p (0, T; X) we denote the Bochner-Sobolev space consisting of functions u G 
L p (0, T; X) which possess a weak derivative u G L p (0, T; X). Two equivalent norms 
on W^iO, T; X) are given by 

(II«III P (o,t ;X ) + II«IIl P (o,t;x)) 1/p and (ll«(o)l& + Ml H0 ,T; X) ) 1/P i (i-s) 

where the extension to the casep = oo is clear. We use i? 1 (0, T; X) = 1T /1,2 (0, T; X). 
Moreover, we define the space of functions in H 1 (Q,T; X) vanishing at t = 

H\ o} {0,T;X) = {u e H\0,T-X) : u(0) = 0}. (1.4) 

Details on Bochner-Lebesgue and Bochner-Sobolev spaces can be found in Yosida 
[1965], Gajewski et al. [1974], Diestel and Uhl [1977], or Ruzicka [2004]. 

Yield function and admissible stresses. We restrict our discussion to the von 
Mises yield function. In the context of linear kinematic hardening, it reads 

^) = {\a D + x D \ 2 -^)/2 (1.5) 

for S = (<t, x) £ S 2 , where |-| denotes the pointwise Frobenius norm of matrices 
and 

ct d =a - - (trace a) I (1.6) 
a 

is the deviatoric part of <r. The yield function gives rise to the set of admissible 
generalized stresses 

K = {£ e S 2 : 0(E) < a.e. in Ct}. 

Let us mention that the structure of the yield function cj> given in (1.5) implies the 
shift invariance 

Se/C E + (t, -t) € K. for all t e S. (1.7) 

This property is exploited quite often in the analysis. 

Due to the structure of the yield function <j), a D + x D appears frequently and 
we abbreviate it and its adjoint by 

2?E = <j d + x° and V*cr = j (1.8) 
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for matrices £ £ S 2 as well as for functions £ £ S 2 and £ g i^(0,T; S 2 ). When 
considered as an operator in function space, T> maps 5 2 and L p (0,T; S 2 ) contin- 
uously into S and L p (0,T; S), respectively. For later reference, we also remark 
that 

V*VS = (°^ D + ^\ and (V*V) 2 = 2V*V 

holds. Due to the definition of the operator T>, the constraint 0(£) < can be 
formulated as HPEH^oo^.g) < ctq. Hence, we obtain 

Ee/C £>£ € L°°(fi;S). (1.9) 

Here and in the sequel we denote linear operators, e.g. I? : S 2 — > 5, and 
the induced Nemytzki operators, e.g. V : £^(0, T; S 2 ) -> F X (0,T;5) and V : 
L 2 (0,T; S 2 ) — > L 2 (0,T; S), with the same symbol. This will cause no confusion, 
since the meaning will be clear from the context. 

Operators. The linear operators A : S 2 — > S 2 and B : S 2 —> V are defined as 
follows. For E = (cr, x) G <S 2 and T = (r, /x) e S* 2 , let AE be defined through 

(T, AE) S 2 = / r:C" 1 crdx+ / /iiH^xdx. (1.10) 
Jn Jn 

The term (1/2) (AS, S)g2 corresponds to the energy associated with the stress 
state E. Here C _1 (a;) and H _1 (a;) are linear maps from § to § (i.e., they are fourth 
order tensors) which may depend on the spatial variable x. For E = (tr,%) € S 2 
and v 6 V, let 

(BS,v) viy = - I a:e(v)dx. (1.11) 
We recall that e(v) = (Vv + (Vv) T ) /2 denotes the (linearized) strain tensor. 

Standing assumptions. Throughout the paper, we require 
Assumption 1.1. 

(1) The domain fl C K d , d — 3 is a bounded Lipschitz domain in the sense of 
[Grisvard, 1985, Chapter 1.2]. The boundary of f2, denoted by T, consists 
of two disjoint measurable parts Tn and To such that T = Tn U To- 
While Tn is a relatively open subset, is a relatively closed subset of 
r. Furthermore is assumed to have positive measure. In addition, 
the set fl U Tn is regular in the sense of Groger, cf. Groger [1989]. A 
characterization of regular domains for the case d £ {2, 3} can be found in 
[Haller-Dintelmann et al., 2009, Section 5], This class of domains covers a 
wide range of geometries. 

(2) The yield stress do is assumed to be a positive constant. It equals \/2/3 ao , 
where Co is the uni-axial yield stress. 

(3) Cr 1 and HT 1 are elements of L°°(fi; £(§, S)), where £(§,§) denotes the 
space of linear operators § — > S. Both C _1 (a;) and M.~ 1 (x) are assumed 
to be uniformly coercive. Moreover, we assume that C _1 and H^ 1 are 
symmetric, i.e., r : C _1 (a;) cr = cr : C _1 (x) r and a similar relation for H _1 
holds for all cr, r S S. 



Assumption 1.1 (1) is not restrictive. It enables us to apply the regularity results 
in Herzog et al. [2011a] pertaining to systems of nonlinear elasticity. The latter 
appear in the time-discrete forward problem and its regularizations. Additional 
regularity leads to a norm gap, which is needed to prove the differentiability of the 
control-to-state map. 
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Moreover, Assumption 1.1 (1) implies that Korn's inequality holds on f2, i.e., 

\H\ 2 H x ( n m < ck (||t*||i 9( r D . 1 R-) + lk(«)lll) (1-12) 

for all u G J ff 1 (fi;R d ), see e.g. [Herzog et al., 2011a, Lemma C.l]. Note that (1.12) 
entails in particular that ||e(u)||s is a norm on Hj-,(Cl; R d ) equivalent to the standard 
H 1 (il;TSL d ) norm. A further consequence is that B* satisfies the inf-sup condition 

\\u\\ v < ^ \\B*u\\ S 2 for all u G V. (1.13) 

Assumption 1.1 (3) is satisfied, e.g., for isotropic and homogeneous materials, 
for which 

C _1 cr = — — cr — - — — — — trace(cr) I 

with Lame constants fx and A, provided that ji > and dX + 2ji > hold. These 
constants appear only here and there is no risk of confusion with the plastic multi- 
plier A. A common example for the hardening modulus is given by H _1 x = \/k\ 
with hardening constant k\ > 0, see [Han and Reddy, 1999, Section 3.4]. 

Clearly, Assumption 1.1 (3) shows that (AS, S)g2 > a||S||| 2 for some a > 
and all S G S 2 . Hence, the operator A is S^-elliptic. 

1.2. Weak formulation of (1.1) and known results. Testing the strong for- 
mulation of the equilibrium of forces 

divcr(i) = in f2 and <x(i) • n = g(t) on Tjv 

with v iE V and integrating by parts, we obtain 

(E(t),B*v)s*=- I <r{t):e(v)dx = - [ g(t) ■ vds = (£(t), v) v ., v 
Jn Jr N 

for all v G V, or equivalently BS(t) = t(t) in V , with I = Eg, see (1.2). 

In order to derive the weak formulation of the first two equations of (1-1), we fix 
an arbitrary test function T = (t, /x) G /C. Testing the first and second equation of 
(1.1) with r — <r(i) and fi — x(£), respectively, we obtain 

(A±(t) + B*ii(t), T - S(t)) S 2 + [ X(t) VE(t) : (VT - £>£(*)) da; = 0. 

Jn 

Due to the complementarity relation in (1.1) and E(t),TgKwe find 

/ \(t)VE(t):(DT-VE(t))dx< / X(t) (\VE(t)\ \VT\ - a 2 ) dx < 0. 
Jn Jn 

Hence, we have 

(AE(t) + B*ii(t), T - S(t)) S 2 > for all T G K. 

We have derived the weak formulation of (1.1) in the stress-based (so-called dual) 
form. It is represented by a time-dependent, rate-independent variational inequality 
(VI) of mixed type: find generalized stresses X G ff 1 (0,T;S' 2 ) and displacements 
u G H r {Q,T\ V) which satisfy E(t) G K, and 

(AE(t) + B*u(t), T - S(i)) S 2 > for all T G K, 

BT,(t)=l(t) mV, (Vl1 

f.a.a. t G (0,T). Moreover, (VI) is subject to the initial condition (S(0),it(0)) = 
(0, 0). In order to guarantee the existence of a solution, we have to require £(0) = 0. 
Note that a weak formulation involving the plastic multiplier A is given in (1.22). 

The remainder of this section is devoted to known results on the analysis of (VI) . 
We start with the results given in [Han and Reddy, 1999, Section 8]. We point out 
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that the authors handle a general situation which includes the case of kinematic 
hardening as a special case. 

Reformulation as a sweeping process in S. For £ £ V 1 , we denote by 

K t = {£ e K : BY, = £} (1.14) 

the subset of K, on which the constraint BY = £ is fulfilled. Testing the first equation 
of (VI) with T £ K m results in: given £ £ j^ o} (0,T; V), find £ G i?| Q} (0, T; 7'), 
satisfying £(i) G /Q( t ) and 

(AY(t), T- S(t)) S a > for aU T G /Q (t) and almost alH G (0,T). (1.15) 

This is called the stress problem of plasticity. It is a time-dependent VI, where the 
associated convex set IC^m changes in time. Such an equation was introduced in 
Moreau [1977]. We mention that there are existence and uniqueness results, but no 
continuity results for the abstract situation considered in Moreau [1977] seem to be 
available. 

Stress problem. [Han and Reddy, 1999, Section 8] deals with the so-called dual 
formulation (1.15). In [Han and Reddy, 1999, Theorem 8.9] the existence and 
uniqueness of a solution £ of (1.15) is shown together with the a-priori bound 

II^II^CO.TjS 2 ) < C\\£\\ H i( . T . v ,y (1-16) 

Let us denote the solution map by £/ s , i.e. £ = C7 S (£). Additionally, [Han and Reddy, 
1999, Theorem 8.10] shows the local Holder continuity of index 1 /2 of £ s : Hh } (0, T; V) -> 
L°°(0,T;S 2 ), i.e., 

||S E (*i)-G E (*2)||'„ (0T . s2) <C(||ii||^ ( o,r ! vo + ll^||^(o,r i vo)ll^-^|U»(o I T i vo- 

(1.17) 

We mention that this is not a typical estimate. As long as the derivatives of £j 
remain bounded in L 2 (0,T;V'), one can control the L 00 -norm of Si — £2 solely 
through the L 2 -norm of the difference £\ ~ £2- We give a generalization of this 
estimate in Theorem 2.3 in the context of EVIs. 

Finally, [Han and Reddy, 1999, Theorem 8.12] shows that one can introduce a 
(not necessarily unique) multiplier associated with the equilibrium of forces _BS = £, 
which can be interpreted as the displacement u. As a consequence, (S,u) satisfies 
(VI). Thus, given £ G i^ 0} (0, T; V), there exists (£, u) G H\ 0} (0, T; S 2 x V) such 
that £(£) G /C and 

(A£(i) + B*ii(t), T S(i)) S 2 > forallTe/C, (1.18a) 
BY,(t)=£(t) inV (1.18b) 

holds for almost all £ G (0,T). This shows the equivalence of (1.15) and (VI). 
Note that (1.18) is equivalent to (VI). In the sequel we use either reference as 
appropriate. 

Kinematic hardening. In the case of kinematic hardening, the uniqueness of u is 
obtained easily. Let us test (1.18a) with T = £(t) + (r, — t) = (<r(t) +r, %(t) — r). 
Due to the shift invariance (1.7), we have T € /C for all t £ S. This yields 

(A£(i) + B*ii(t), (t, -t)) S 2 = for all r G 5. 

Using the definitions of A and _B, see (1.10) and (1.11), respectively, we obtain 

C _1 <T — e(u) — H _1 x = almost everywhere in (0, T) x Q. 
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Integrating from to t and using the initial condition (S(0), u(0)) = yields 

C _1 ct-£(m) -H _1 x = almost everywhere in (0,T) x Q. (1.19) 

Together with the inf-sup condition of B* = (—£,0), see (1.13), this proves the 
uniqueness of u. Using (1.16) yields the a-priori estimate 

l|E||ifi(0, T;S 2 ) + IMIh^O.TjV) < C\\£\\ H i(0,T;V')- ( L20 ) 

We denote the solution mapping I i-> u of (1.18) by Q u . Moreover, the solution 
operator of (VI) mapping t — > (£, u) is denoted by Q = (G 11 , G u )- 

Primal problem. In [Han and Reddy, 1999, Section 7] the primal formulation of 
(1.15) is considered. Both formulations are equivalent, see [Han and Reddy, 1999, 
Theorem 8.3]. For the primal formulation and under the (additional) assumption 
of kinematic (or combined kinematic-isotropic) hardening, the (global) Lipschitz 
continuity of the solution operator from VF 1 ' 1 (0, T; Z') to L°°(0, T; Z) was proven, 
see [Han and Reddy, 1999, pp. 170-171]. Here, Z is the appropriate function space 
for the analysis of the primal formulation. Due to the equivalence of the primal 
and the dual problem, this Lipschitz estimate carries over to the dual problem. We 
obtain 

\\G(li) - ^2)||l~ ( o,T;S 2 xV) < CPi - ti\\w^,T;V>y (1-21) 

This Lipschitz estimate was actually already contained in rather unknown works 
of Groger, see Groger [1978a,b, 1979]. In [Groger, 1979, Section 4] the system 
(VI) is reformulated as an evolution equation associated with a maximal monotone 
operator. Then the Lipschitz estimate follows by the classical result [Brezis, 1973, 
Lemma 3.1]. 

Remark 1.2. In order to derive optimality conditions, the continuity results 
mentioned above are not sufficient. In addition, we need the continuity of 
G : HLy(Q,T;V') H} y(Q, T; S 2 x V) on two occasions. First, this continuity 
is needed to prove the approximability of local solutions by time-discrete minimiz- 
ers in Lemma 3.8. Second, the strong convergence of (S,ti) in L 2 (0,T; S 2 x V) 
is needed to pass to the limit in the optimality system, see [Wachsmuth, 2012b, 
Section 3[. 



The required continuity of Q : ilL, (0, T; V) -> HL } (0, T; S 2 x V) is shown in 
Corollary 2.4 by building on some results of Krejci [1996, 1998] about evolution 
variational inequalities (EVIs). To our knowledge, this is a new result for the 
analysis of (VI). 

Existence, uniqueness and regularity of the plastic multiplier. We have 
already seen that the generalized plastic strain P = —AH — B*u is unique. Thus by 
Herzog et al. [2011b] we obtain the existence and uniqueness of the plastic multiplier 
A € L 2 (0,T; L 2 (fl)) which can be understood as a multiplier associated with the 



constraint £ € JC or rather </>(E) < 0. We obtain the system 

At + B*u + XV*VE = in i 2 (0,T;S* 2 ), (1.22a) 

BY, = l in L 2 (0,T;V'), (1.22b) 

0<A _L 0(E) < a.e. in (0, T) x Q. (1.22c) 



Endowed with the initial condition (E(0),u(0)) = (0,0), this system is equivalent 
to (1.15) and (1.18). 
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2 Analysis of the continuous optimal control problem 

In this section we study the continuous (i.e. non-discretized) optimal control 
problem. In the first subsection, we give a brief introduction to evolution variational 
inequalities (EVIs) and state the continuity of the solution map of an EVI from 
if 1 (0,T; X) into itself, see Theorem 2.2. This enables us to show the continuity of 
the solution map of (VI), see Section 2.2. Section 2.3 is devoted to show the weak 
continuity of the control-to-state map of (P). Due to this weak continuity, we are 
able to conclude the existence of an optimal control in Section 2.4, see Theorem 2.9. 

2.1. Introduction to evolution variational inequalities. In this section we 
give a short introduction to evolution variational inequalities. A comprehensive 
presentation of this topic can be found in Krejci [1998]. We also mention the 
contributions Krejci and Lovicar [1990], Krejci [1996], Brokate and Krejci [1998]. 
For convenience of the reader who wishes to consult Krejci [1998] in parallel, we 
use the notation of Krejci [1998] in this subsection and turn back to our notation 
in the next subsection. 

Let X be some Hilbert space and Z C X be some convex, closed set. Given a 
function u : [0, T] — > X and an initial value xo € Z, find x : [0,T]->ZcI such 
that x(0) = xq and 

(ii(t) - x{t), x(t) - x) x > for all x E Z. (EVI) 

There are many applications that lead to an EVI, see [Krejci, 1998, page 2] for 
references and further comments on this topic. We only mention the strain or 
stress driven problem of plasticity with linear kinematic hardening. In the strain 
(stress) driven problem, the strain (stress) is viewed as a known quantity, whereas 
the stress (strain) has to be determined. We show that the dual formulation (1.18) 
is an appropriate starting point for deriving the strain driven problem. Krejci 
derived the stress and the strain driven problems in [Krejci, 1998, (1-29) and (1.31)], 
respectively. 

In the strain driven problem, the strain e(it) (or, equivalently, the strain rate 
(s:(u), 0) = —B*u) is viewed as a given quantity. The variational inequality (1.18a) 
reduces to find a function S : [0, T] — > JC C S 2 , such that 

^ Ce( J t(<)) ^ , T-H(t)j >0 for all Te/C, (2.1) 

where (•, -)a is the scalar product on S 2 induced by A, i.e., 

(£, T)a := (AH, T) S 2 for£,TeS 2 . (2.2) 

Using the coercivity of A, which is ensured by Assumption 1.1 (3), we find that 
(•, ■) a is scalar product on S 2 which is equivalent to the standard scalar product. 
The equation (2.1) is of the form (EVI) in the Hilbert space X = S 2 equipped 
with the scalar product (•, -)a and the feasible set Z = K,. 

Remark 2.1. We remark that there is a fundamental difference between the dual 
formulation (1.18) and the strain driven problem (2.1). Whereas in (1.18) both X 
and u are unknown quantities, e(u) has to be a-priori known in (2.1). Moreover, 
the solution £ of (2.1) has to fulfill (1.18b). 

Let us denote the solution map of (2.1) which maps u — >• X by S. Therefore, 
(1.18b) yields £ = BE = BS{u). Thus, (1.18) is equivalent to 

Find u such that BS(u) = £. 
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This technique is used in [Krejci, 1996, Chapter III] for hyperbolic equations arising 
in plasticity, i.e., the author considers plasticity problems where the acceleration 
term pit is included in the balance of forces (1.18b). 



The following theorem summarizes results on (EVI) given in Krejci [1996, 1998]. 



Theorem 2.2. Let u € 1U M (0,T;X) and x G Z be given. Then there ex- 
ists a unique solution S(xq,u) := x € W i,1 (Q,T;X) of (EVI). Moreover, 
S : Z x W 1A (0,T;X) -s- L°°(0,T;X) is globally Lipschitz continuous and S : 
Z x W l p {0,T;X) W^iO^T-X) is continuous for all p < oo. 



The operator S is called the stop operator, whereas P(xq,u) = u— S(xo,u) is 
called the play operator. 

Let us show a result which generalizes the local Holder estimate (1.17). In the 
context of EVIs this is a new result. It shows that the play operator P possesses 
an additional smoothing property which is not shared by the stop operator S. 
Another such property is that P maps C(0,T; X) (continuous functions mapping 
[0, T] — > X) to CBV(0,T; X) (continuous functions of bounded variation mapping 
[0, T] -t X), see [Krejci, 1998, Theorem 3.11]. Again, this does not hold for the 
stop operator S. 



Theorem 2.3. Let p £ [l,oo] be given and denote by q its dual exponent. Let 
ui,u 2 & W 1,q {Q,T;X) and initial values x\,x® G Z be given. Let 6 = P(x°,Ui), 
i = 1,2. Then the estimate 

||6 _ 6IIl°=(0,T;X) - 2 (ll""l|U9(0,T;Jf) + ll""2 1] i,9(0, T;X)) \\ u l ~ u 2 \\ Lp(0,T:X) 

+ 116(0) -6(0)||| 

holds, where 6(0) = Ui(0) - x% i = 1, 2. 



Proof. First we recall [Krejci, 1998, (3.16ii)], i.e., (6(i), ii(i))x = a.e. in (0,T), 
where x\ = S{x\,u\). Using x\ — U\ —6: this implies ||6(*)l|x < ||ui(i)||x a.e. m 
(0,T). Analogously, we obtain ||£ 2 (*)IU < ll«2(*)||x a.e. in (0,T). 

Using x = u-zif) — 6(0 — x 2(t) G Z as a test function in (EVI) shows 

(6(t), ui(t)-U2(t)+6(t)-€iW)x >° a.e. in (°> T )- 
Similarly, we obtain 

(&(*), « 2 W-wiW+6W-6Wk >0 a.e. in (0,T). 
Adding these inequalities, we have 

(6W-6W, 6W-6W)x < (6W-6W, «i(*)-«a(*))x a.e. in (0,T). 
This shows 

5 ^ 116 - 6llx = (6 (*) - 6(t), 6(*) - &(*))* 

< (&(*)- 6(t), «l(«)-«2(<))x 
<(||6(t)||x + ||6(*)||x)||«l(t)-«2(t)|| X 

< (||Mi(t)||x + ||«a(*)||x) ||«i(*) ~ «a(*)||x a.e. in (0,T). 
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Integrating from to t yields 

~ ll£i(*)-6(*)|||-~||£i(o)- 6(o)||| 

< / (\\ui{s)\\x + \\u2(s)\\x)\\ui{s) - u 2 (s)\\x ds 
Jo 

< (ll^l||L9(0,T:X) + ||W2|U«(0,T;X)) ll u l _ u 1 1 1 L*>(0,T;X) ■ 

Taking the supremum t £ (0, T) yields the claim. 



2.2. Quasistatic plasticity as an EVI. In this section we give an equivalent 
reformulation of (1-18) which fits into the framework of Krejci [1998]. Once this 
reformulation is established, the continuity of Q : H^(0, T; V) — > if j j(0, T; S 2 x 
V) is a consequence of Theorem 2.2. As mentioned in Remark 1.2, the continuity 
of Q is used in several places. 

In order to reformulate (1.18), we need some preparatory work. Due to the 
inf-sup condition (1.13) we obtain that for all £ £ V' there is a at £ S, such that 

(<Ti,0) G (keiB) 1 - and B{a t ,Qi) = t, (2.3) 

see Brezzi [1974]. Moreover, this mapping £ h-> <7g is linear and continuous. For 
t £ H} 0} (0,T;V), we define a t {t) := cr e(t) . Hence, cr ( . ) maps iaTL, (0, T; V) -» 
i/| O j(0,T; S) continuously. For convenience, we also introduce the notation 

S, = {a t , -at). (2.4) 

Let us mention that this is a useful tool to construct test functions T for (1.15) 
and (1.18). Indeed, for arbitrary S £ JC and £ £ V', let us define the test function 
T = S + S^-bs- The shift invariance (1.7) implies T £ JC and BT — £ is ensured 
by the definition of S^. Hence we obtain T £ JCg, i.e., it is an admissible test 
function for (1.15) and (1.18). 

Let £ £ i?| O j(0, T; V') be arbitrary. Using (2.4) we are able to reformulate 
(1.15) as an EVI. As stated in (1.7), S £ JC holds if and only if S + (t, -t) £ JC 
holds for all r £ S. Since Y,g — {ag^—crg) by definition, S £ JC if and only if 
S — S^ e /C. This gives rise to the decomposition S = So + S^. There are two 
immediate consequences: S £ JC is equivalent to So £ JC and _BS = £ is equivalent 
to S £ kerB. Therefore, S £ JC n kerB. 

We define /C B := /C n kcrS. Let T e JC B be arbitrary. Testing (1.18a) or (1.15) 
with T + S^ (t) £ JC yields an equivalent reformulation: find So £ JCb such that 

(A(S + S^), T-S ) >0 forallTe/C B . (2.5) 

Hence, (2.5) is an EVI in the Hilbert space S 2 equipped with (•, -)a- Theorem 2.2 
yields the continuity of the mapping S f H> S from Hh-. (0, T; S 2 ) to Hh-, (0, T; S 2 ). 
Since £ i— ^ S^ is continuous from H} -> (0, T; V' ) i?r i(Q, T; S 2 ), we obtain the 
continuity of the mapping £ i-> S from ff| 0} (0, T; V') to i? { 1 }( ' T 5 S ' 2 )- 

The uniqueness and continuous dependence of u on £ follows from (1.19) and 
the inf-sup condition of B*, see (1.13). Moreover, Theorem 2.2 also shows the 
Lipschitz continuity of Q : £ i-> (S,m), W 1 ' 1 ^, T; V) -> L°°(0,T;S 2 x 7), see 
(1.21). Summarizing, we have found 

Corollary 2.4. The solution operator Q of (VI) is continuous from ffr } (^' ^> 
to ifL,(0,T; S 2 x V) and globally Lipschitz continuous to L°°(0,T; S 2 x V). 
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Remark 2.5. The technique presented in this subsection is not restricted to kine- 
matic hardening only, the arguments remain valid for all hardening models involving 
some kinematic part, see also Groger [1979]. To be precise, the technique is applica- 
ble whenever the hardening variable can be split into a kinematic part x an d some 
other part rj, such that the yield function cf> can be written as <f>(S) = 4>{<r + x, v)i 
where £ = (<r,x,J7). 

It is interesting to note that this is exactly the case for which Han and Reddy 
[1999] were able to prove Lipschitz continuity of the forward operator (in primal 
formulation) from W 1 ' 1 ^^; Z') to L°°(0,T; Z), see the discussion in Section 1.2. 



2.3. Weak continuity of the forward operator. In this section we show the 
weak continuity of the control-to-state map Q o E of the optimal control problem 
(P). Here, Q is the solution map of (1.18) and E is the control operator, see (1.2). 
Since the solution operator Q o E is nonlinear, the proof of its weak continuity is 
non-trivial. The weak continuity of Q o E from HL } (0, T; U) to Hh } (0, T; S 2 x V) 
is essential for proving the existence of an optimal control in Theorem 2.9. 

A main ingredient to prove the weak continuity is the compactness of the con- 
trol operator E from U = L 2 (T N ;R d ) to V = (^(0;M d ))', see the proof of 
Lemma 2.6. Due to this compactness, Aubin's lemma, see, e.g., [Simon, 1986, 
Equation (6.5)], implies that i/ 1 (0,T; U) embeds compactly into L 2 (0,T; V). 

Let us mention that the weak continuity of Q o E is a non-trivial result: although 
E : U ->• V is compact, the operator E : ff x (0,T; U) -> ^((XT; V) (applied in 
a pointwise sense) is not compact. Hence the weak continuity of Q o E does not 
simply follow from the compactness of E nor the continuity of Q. 

First, we need a preliminary result. It can be interpreted as an upper semicon- 
tinuity result for the left-hand side of (1.18a). 



Lemma 2.6. Let {(£ fe ,w fc )} C (0, T; S 2 x V) and (S,tt) e H l {0,T;S 2 x V) 
be given. Moreover, we assume that there are g k , g G i? 1 (0, T; U) such that BH^ = 
Eg k and BY, = Eg hold. If 

(S fc ,u fc ) - (E,u) inff 1 (0,T;S 2 x V) and 
g k ^g mH\0,T;U), 
then for all T G L 2 (0, T; 5 12 ) we have 

limsup / {A± k + B*u k ,T -Y, k )&t< ( (At + B*ii, T - S) dt. 

fc— >oo Jo JO 



Proof. Due to the weak convergence of (Sfc,tife) in iJ 1 (0,T; 5 2 x V) we have 

/ (A± k + B*u k ,T)dt-> [ {AS + B*u, T) dt. 
Jo Jo 

Since U embeds compactly into V, Aubin's lemma yields that H 1 {Q,T;U) 
embeds compactly into L 2 (0,T;V) and hence g k — 1 g in iJ 1 (0,T;C7) implies 
-> -Eg in L 2 (0,T; V). This shows 

/ (B*M fc , S*) dt = / (u fc) BS fc ) dt = / {ii k , Eg k ) dt 
Jo Jo Jo 

-> / (m, Eg) dt = / («, BE) dt = / {B*ii, S) dt. 
Jo Jo Jo 
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To address the remaining term, we use integration by parts and obtain 

J (AS k , S k )dt = ~((AS k (T), S fc (T)) - (AS k (0), E fe (0))). 

The functionals S h-> (AS(i), S(t)) are continuous (w.r.t. the H 1 (0, T; S^-norm) 
and convex, hence weakly lower semicontinuous. Due to 2^(0) = 0, this al- 
ready implies (AS(0), 2(0)) = 0. Now the weak lower semicontinuity of S 4 
(AS(T), S(T)) yields 

liminf(AS fe (T), S fe (T)) > (A£(T), S(T)). 

k— J-oo 

Therefore, 

liminf / (AS fe , S fe ) dt > ~(A£(T), S(T)) = / (AS, S) dt 

k->-00 J Q 2 Jq 

holds. By combining the results above, we obtain the assertion 

limsup/ (AS k + B*u k , T- S k ) dt < / (AS + B*ii, T - S) dt. 

fc— >oo JO JO 



The previous lemma enables us to prove 



Theorem 2.7. The operator Q o E is weakly continuous from f/| o j(0, T; [/) to 
m 0| (0,T;5 2 xF). 



Proof. Let {<7 fc } C if/ i(0,T;Z7) be a sequence converging weakly towards g € 
i?| o j(0,T; {/). The solution of (1.18) with right-hand side Eg, Eg k is denoted by 
(S,u), (S k ,u k ), respectively, i.e., (S k ,u k ) = G(Eg k ) and (S,u) = G(Eg). Due to 
the boundedness of E and Q, see (1.20), there exists a subsequence (for simplicity 
denoted by the same symbol) {(S k ,u k )} which converges weakly towards some 
(S,u) in iJ 1 (0,T;S' 2 x V). We shall prove (S,u) = (S,u), therefore the whole 
sequence converges weakly and this simplification of notation is justified. To this 
end, we show that (S,u) is a solution to (1.18) with right-hand side Eg. 

First we address the admissibility of S in (1.18): the set {T € i^^TjS 2 ) : 
T(t) € JC f.a.a. t £ (0,T)} is convex and closed, hence weakly closed. Since all S k 
belong to this set, S belongs to it as well. Therefore S(t) £ JC holds for almost all 

te(o,T). 

Since B is linear and bounded, it is weakly continuous and hence BS k — v BS 
in F x (0,T;F'). Due to BS k = Eg k ->■ Eg in H l (0,T;V), we have BS = Eg in 
H x (0,T;V), this shows (1.18b). 

Let T G L 2 (0,T; S" 2 ) with T(t) G /C f.a.a. t £ (0,T) be given. This implies 

(AS k (t) + B*u k (t), T(t) - S fc (t)) > f.a.a. t G [0,T], 

see (1.18a). Integrating w.r.t. t yields 

/ (AS k + B*u k ,T-S k )dt>Q. 
Jo 

By applying Lemma 2.6, we obtain 

t . i-T 
(AS + B*ii, T - S) dt > limsup / (AS k + T - S k ) dt > 0. 

k^-oo Jo 
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Since T 6 L 2 (0,T; S 2 ) with T(t) G /C f.a.a. i 6 (0,T) was arbitrary, 

(-AE(i) + B*u(t), T — S(t)) > forallTe/C (2.6) 
holds for every common Lebesgue point t of the functions 

ti-> (AE(*)+.B*ii(t), E(*)) € ^(O.Tjm) 

and thi AS(f) + B*i(t)eL 2 (0,T;S 2 ). 

Lebesgue's differentiation theorem, see [Yosida, 1965, Theorem V.5.2], yields that 
almost all t € (0, T) are Lebesgue points of these functions. This implies that (2.6) 
holds for almost all t <G (0,T). Hence (S,m) solves (1.18). Since the solution is 
unique we obtain (X, u) = (£, u). This implies that the weak limit is indeed inde- 
pendent of the chosen subsequence. Hence, the whole sequence converges weakly, 
i.e. (E fc ,Ufc) (E,u) mH 1 (0,T;S 2 xV). That is, G(Eg k ) = (E fc ,u fc ) - (E,u) = 
Q(Eg) in iJ 1 (0,T; 5 2 x 1/). Since {</ fe } and g were arbitrary, o _E is weakly con- 
tinuous. 



2.4. Existence of optimal controls. In this section we prove the existence of an 
optimal control. The key tool in the proof of Theorem 2.9 is the weak continuity 
of the forward operator proven in Theorem 2.7. For convenience, we repeat the 
definition of the optimal control problem under consideration 



Minimize F(u,g) = tp(u) + - HsH^i^E/) 
such that (E,u)=Q(Eg) 
and g e E/ ad . 



(P) 



The admissible set [7 a( j is a convex closed subset of (0, T; U). Here, ||-||jji(o,T;!7) 
can be any norm such that i/ 1 (0,T; U) is a Hilbert space, see (1.3). Let us fix the 
assumptions on the objective ip and on the set of admissible controls U ad . 



Assumption 2.8. 

(1) The function ip : i? 1 (0,T; V) — > K is weakly lower semicontinuous, contin- 
uous and bounded from below. 

(2) The cost parameter v is a positive, real number. 

(3) The admissible set U ad is nonempty convex and closed in f/| o j(0,T; U). 



Let us give some examples which satisfy these conditions. Since the domain of 
definition of ip is i/ 1 (0,T; V) we can track the displacements, the strains or even 
their time derivatives (or point evaluations). As examples for ip we mention 



- ||u - u d \\ 2 L2(0jT;L2(nm) , i/j 2 (u) = ~ \\u(T) - u T ,d\\ 2 L*(n-jfi<i) 



^(u) = ±\\e(u(T))-e T Ah(n;S)- 

Here, u d e L 2 {0,T; L 2 (fl;R d )), u T , d & L 2 (fl;R d ) and e T ,d € L 2 (^;§) are desired 
displacements and strains, respectively. 

Let us give some examples for the admissible set U ad - The sets 

Ul d = {ge H\0,T;U) : g(0) = 0, \\g\\ LH T N ^) < P a.e. in (0,T)} 

U! d = {ge H\0, T; U) : g(0) = g(T) = 0} 

satisfy Assumption 2.8 for every p > 0. 
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Of special interest is the combination of ip 2 and U 2 d . Since we consider a qua- 
sistatic process, the condition g(T) = implies that at time t — T the solid body 
is unloaded. Due to the plastic behaviour, the remaining (and lasting) deformation 
u{T) is typically non-zero. Due to the choice of ip 2 , the displacement is controlled 
towards the desired deformation ux.d- Thus, choosing if; 2 and U 2 d allows the con- 
trol of the springback of the solid body. This is of great interest in applications, 
e.g. deep-drawing of metal sheets. 

Let us show the existence of an optimal control. Using standard arguments, see, 
e.g. [Troltzsch, 2010, Theorem 4.15], the existence of a global minimizer of (P) is a 
straightforward consequence of the weak continuity of QoE proven in Theorem 2.7. 

Theorem 2.9. There exists a global minimizer of (P). 



Proof. In virtue of the control-to-state map Q o E, we define the reduced objective 
f(g) := F(Q u (Eg),g). Due to Assumption 2.8, / is bounded from below, hence 
inf / on U ad exists. We define j := m{{f(g),g G U ad } £ K. 

Let {<7j.} C U ad be a minimizing sequence, i.e. lim/(g fc ) = j. Since ip is bounded 
from below and v > 0, the sequence {g k } is bounded in iJ 1 (0,T; U). Hence, there 
is a weakly convergent subsequence, denoted by the same symbol. We denote the 
weak limit by g. The closeness and convexity of U ad implies the weak closeness. 
Therefore, {g k } C U ad implies g £ U ad . 

Since Q u o E is weakly continuous from H (0, T; U) to H} Q -,(0,T;V), see 
Theorem 2.7, the weak lower semicontinuity of ip and ||'||jji(o t-!7) i m P nes that 
the reduced objective 

/ = F{g» o £?(•), •) = iP{Q u o £(•)) + \ Hh(0,T;U) 
is weakly lower semicontinuous. Therefore, 

j = lim/(g fc ) (g k is a minimizing sequence) 

> f(g) (weak lower semicontinuity) 

> 3 (j = inf / and g e U ad ). 
This shows F(Q u (Eg),g) = f(g) = j, hence g is a global minimizer. 



3 Time discretization 

In this section we study a discretization in time of the optimal control prob- 
lem (P). The time-discrete version of the forward system (VI) is introduced in 
Section 3.1. The strong convergence of the time-discrete solutions (S T , u T ) is shown 
in Section 3.2. In particular, we prove a new rate of convergence of (S T ,it T ) in 
L°°(0,T; S 2 x V) without assuming additional regularity of S, see Lemma 3.2. 
Moreover, we show the convergence of (S r ,u T ) in iJ 1 (0,T;5 2 x V) (by using an 
idea of Krejci [1998]) and the convergence of A r in L 2 (0, T; L 2 (fl)), see Theorems 3.3 
and 3.4, respectively. These strong convergence results are new for the analysis of 
the quasistatic problem (VI) and both are essential for passing to the limit in the 
optimality system in Section 3. 

In Section 3.3 we prove the weak convergence of (X T ,it r ) in iJ 1 (0,T;S' 2 x V) 
assuming the weak convergence of the controls g T in iJ 1 (0,T;t/). This result is 
necessary in order to show in Section 3.4 that every local minimizer of (P) can 
be approximated by local minimizers of (slightly modified) time-discrete problems 
(Pp, see Theorem 3.10. This approximability of local minimizers is essential for 



If) 



GERD WACHSMUTH, September 6, 2012 



the derivation of necessary optimality conditions which are satisfied for every local 
minimizer of (P) in [Wachsmuth, 2012b, Section 3]. 

Notation. Throughout the paper we partition the time horizon [0, T] into iV in- 
tervals, each of constant length r = T/N for simplicity. We use a superscript r, i.e. 
(•) T , to indicate variables and operators associated with the discretization in time. 

For a time-discrete variable f T G X , where X is some Banach space, the 
components of f T are denoted by f[ , i — 1, . . . , N. When necessary, we may refer 
to /o with a pre-defined value (interpreted as an initial condition), mostly /q = 0. 

If f T G X N is the time-discretization of a variable / G H^(0, T; X), we identify 
f T with its linear interpolation, i.e., for t G \{i — 1) T,ir] we define 

r(*) = ^^/r+— /r-i. (3-D 

T T 

with /<[ = 0. Therefore, X N is identified with a subspace of HL } (0, T; X). We 
make use of this identification for the variables S, u and g. For later reference, we 
remark 

/ T W = ;C/7-/f-i) and fT = r(t)-(t-iT)f-(t) (3.2) 

for almost all f G ((i — 1) r, i r) and £ = 1, . . . , N. 

On the other hand, if f T G X w is the discretization in time of a variable belonging 
to L 2 (0,T; X), we identify f T with a piecewise constant function in L 2 (0,T;X), 
i.e., for t G [(i — 1) r, i r) we define 

/ T (i) = /r- (3.3) 
This is used for the plastic multiplier A. 

3.1. Introduction of a discretization in time of the forward problem. In 

this section we introduce a discretization in time of the forward problem (VI). 
Replacing the time derivatives by backward differences, we obtain the discretized 
problem: given t T G {V') N , find (£ r ,« T ) G {S 2 x V) N such that £[ G JC and 

(A&T-IlT_J + B*(uT-uT_ 1 ),T-i:T)>0 forallTG/C, (3.4a) 

SET = l\ in V, (3.4b) 

holds for all i G {1, . . . ,iV}, where (S^mJ) = 0. We denote the solution operator 
which maps l T — > (£ t ,m t ) by CJ T . Let us mention that, for fixed i, (3.4) can be 
interpreted as the solution of a static plasticity problem. Several properties of this 
time discretization are proven in [Han and Reddy, 1999, Proof of Theorem 8.12, 
page 196]. We recall the results which are important for the following analysis. 

In order to show an analog to (1.19) for the time-discrete problem, we test (3.4a) 
with T — + (t, — t), where r G S is arbitrary. Using (Sq, Uq) = we obtain 

C _1 <r T - e(u T ) - M~ 1 x T = a.e. in (0, T) x fl. (3.5) 

In [Han and Reddy, 1999, Lemma 8.8 and (8.37)], we find the a-priori estimate 

II^' T ||h 1 (0,T;S 2 ) < C p T ||ifi(0,T;V')' (3-6) 
Together with the inf-sup condition (1.13) of B* and (3.5), we obtain 

l|S T ||_H- 1 (0,T;S 2 ) + ||w T ||.ff 1 (0 1 T;V) < C 11^ II H 1 (0,T;V) • (3-7) 

Due to this a-priori estimate, Han and Reddy [1999] are able to prove the weak 
convergence of (S t ,m t ) = Q T {l T ) towards the solution = Q{1) of (1.18) in 

il 1 (0, T; 5 12 x V) if the data £T in (3.4) is chosen as the point evaluation of I. 
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From an optimization point of view, this result is too weak for two reasons. First, 
the convergence is not w.r.t. the strong topology in iJ 1 (0,T;5' 2 x V). Second, the 
convergence is proven for a restricted choice of £ T . We therefore strengthen the 
results of Han and Reddy [1999] in Theorem 3.3. 

As for the continuous problem, the time-discrete problem can be formulated as 
a complementarity system. The results in [Herzog et al., 2011b, Section 2] prove 
the existence of A T € L 2 (fl) N such that the complementarity system 

AC£J-Y.U) + B*(uT-uU)+t\IV*VY,1 = mS 2 , (3.8a) 

BET = £T in V', (3.8b) 

< A[ _L 4>(SJ) < a.e. in Q (3.8c) 

is satisfied. Here, we scaled the multiplier \ T appropriately. 

Similarly to (3.5) we obtain a representation formula for A T , 

- C-Vl - *U) + e« - = -H- X (xl - XU) = t A[2?ST. (3.9) 

Remark 3.1. The same time discretization approach is used in Krejci [1996, 1998] 
to prove the existence of a solution of (EVI). Indeed, by applying the time dis- 
cretization [Krejci, 1998, (3.6)] to the reduced formulation (2.5), we obtain: find 
€ /Qt , such that 

(A(EJ - EJLi), T ST) > for all Te/Qr. 

As discussed for the continuous problem (1.15), we can introduce a Lagrange mul- 
tiplier associated with -BE[ = £J such that (£ T ,it T ) satisfy (3.4). 



In the following two subsections we prove that the discretization in time con- 
verges in a strong and a weak sense, similar to the continuity properties of the 
forward operator Q proven in Sections 2.2 and 2.3. Under the assumption that £ T 
converges strongly to I in the space iJ 1 (0,T;y') or that g T converges weakly in 
the space i? 1 (0, T; U), the corresponding solutions (S T , u T ) are shown to converge 
strongly (or weakly) to (X, u) in -ff 1 (0, T; S 2 x V), respectively. 

Both statements will be needed in Section 3.4 to show the approximability of 
local solutions of (P) mentioned in the introduction of this section. 

3.2. Strong convergence. The strong convergence of the discretization in time of 
an EVI can be found in [Krejci, 1996, Proposition 1.3.11]. He proves the convergence 
in VT /1,1 (0,T;X) of the time-discrete solutions. The data of the time-discretized 
problems are taken as the point evaluations of the data of the continuous prob- 
lem. We use these ideas to prove the convergence of (S T ,M r ) in 7J 1 (0,T;5' 2 x V) 
whenever the right-hand sides l T converge in _ff 1 (0,T; V') as r \ 0. Moreover, we 
show the convergence of A T in L 2 (0, T; L 2 (fl)) as r \ 0. This is a new result for 
the analysis of (VI) and a crucial ingredient for proving the optimality system in 
[Wachsmuth, 2012b, Section 3[. 

Let us consider a sequence of time steps {r^} and a sequence of loads £ Tk G 
(V') Nk , with N k r k = T, such that the linear interpolants £ Tk e H^ o} (0, T; V) con- 
verge in H^O^iV) towards some I e H^ o} (0,T;V). We denote by (S Tfc ,u Tfc ) e 

(S 2 x V)^ the solutions of the time-discrete problem (3.4a), as well as their piece- 
wise linear interpolants. The aim of this section is to prove (S rfc ,u Tk ) —> (S,tt) in 
ff 1 (0, T; S 2 x V), where (£, u) is the solution to the continuous problem (1.18). 

For simplicity of notation, we omit the index k and we refer to u £ Tk — ¥ £ if 
k — > oo" simply as il £ T — > £ as r \ 0". 
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The proof of Theorem 3.3 relies on the convergence argument Lemma A.l which 
is tailored to the analysis of EVIs. Its prerequisite (A. la) is the convergence of E T 
in L°°(0, T; S 2 ). This is addressed in a preliminary step and required in the proof 
of Theorem 3.3. 

Lemma 3.2. Let l r be a bounded sequence in H^(0,T; V') such that i T — > £ in 
W 1 ' 1 (0,T;V) asr\0. Then 

||S - S T ||i oo(0)T;S2) < C - tf LH0 , T . v>) + r). (3.10) 
In particular, E T -> E in L°°(0, T; S" 2 ) as r \ 0. 

Proof. We have 

~||E(t) - S^(t)||i = (At(t) At T (t), E(t) - E T (i)) f.a.a. « e (0,T). (3.11) 

Let us fix some t G ((i — 1) r, ir), z £ {1, . . . , A}. Since E T (i) is a convex combi- 
nation of ET and see (3.1), and ST_ l7 EJ € /C, we obtain E r (£) € /C. Hence, 
T = E r (i) + Hi-i-r(t) e /C. Using ST = (1.18a) implies 

(i4E(<), £ T (i) - S(t) + > 0. (3.12) 

Testing (3.4a) with T = E(i) + E^-^(t) yields 

(A(£[ - STLJ, S(<) - ST + S^ (t) ) > 0. 

Using (3.2) we obtain 

<AS T (t), E(t) - S T (i) - (it - t) S r + 5V (tM(t)+(iT _ t)iT(t) ) > 0. 

Together with (3.12) this yields 

(At(t) - At T (t), E(t) - £ r (i)) < (At(t) - At T (t), E/_*r(t)) 

+ (it -t){A± T {t), -± T (t) + -£ t (t)). 

Integrating over t and using (3.11), together with the boundedness of S and S^ T 
in L 2 (0, T; S 2 ), see (3.6), yields 

~||S(t) - E r (t)|fc < ^ (AE(s) - At T (s), H t -tr{a)) ds + Cr. 
Integrating by parts and using S(0) = S T (0) = gives 
~||£(t) - E T (t)|ft < (AS(t) - ATT(t), Vt-trQk)) 

-f (AH(s)- AS r (s), £^(s)) ds + Cr 
Jo 

<C(||E(t)-E T (i)|| s2 p(t)-r(t)|| v , 

+ II E - E T ||l=o(o,t ; s 2 ) IK - ^"IU^o.tjV) + T )- 
Taking the supremum over t £ (0, T) on both sides shows 

II s - E r || 2 :oo(0jT;S 2 ) < C (||S - S r || L oc.( 0iT;S 2) p - ^ t ||l~(o,T;V) 

+ l|E — E r || L oo( ,T;5 2 ) |K~ ^ T ||l!(0,T;\/') + t). 

Finally, Young's inequality and W X ^%T\ V) c -> £°°(Q,T; V) yields 

lis - shi!oo (0>t . S2) < c (||i - rni 1(QtT;V/) + r). 



Let us comment on the case when l T is the point evaluation of £. The estimate 
(3.10) shows the order of convergence of t 1 / 2 w.r.t. the L°°-norm provided that £ € 
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T4 Al,1 (0,T; V). This rate of convergence is a new result for the time-discretization 
of (VI). 

In [Han and Reddy, 1999, Theorem 13.1] the estimate 

l|S - S T || i = O ( 0iT; 52 ) < CT ||£|| W 2,1( ,X;S 2 ) 

is shown under the assumption X G M /2,1 (0, T; S 2 ). However, this assumption is 
unlikely to hold for the solution of an EVI. There are many examples where the 
solution of an EVI only belongs to W /1,oo (0, T; X), even if X is one-dimensional 
and the input is smooth, see e.g. [Krejci and Lovicar, 1990, Examples 1, 2]. Hence, 
X G W 2,1 (0, T; S 2 ) cannot be considered the generic case. 



Theorem 3.3. Let us assume i T -> t in Hh } (0, T; V) as t \ 0. Then (S r , u T ) -> 
(E, w) in iJ^j (0, T; S* 2 x V) as r \ 0. 



Proof. We follow the idea of [Krejci, 1998, Proof of Theorem 3.6]. We will apply 
Lemma A.l with the setting 

X — S 2 A , u„ = S ( t-2S t , u = E.«-2E, 
p = 2, g n = ||S iT (-)IU, ffo = H^COIU- 

to show the convergence of E T . Here, 5^ denotes the Hilbert space S 2 equipped 
with the inner product induced by A. To apply Lemma A.l, we have to verify the 
prerequisites (A.lc) and (A. Id). 

By definition of the discrete problem (3.4), we have for a fixed i G {1, 2, . . . , N} 

(A(I% - EJLj), T - ST) > for all T £ fC, BT = ij, 

which is the time-discrete analog to (1.15). We choose T = SJ j + S|t-^t_ , see 
(2.4) for the definition of S^. By definition of S^ we have BT = £T and the shift 
invariance (1.7) implies T G K,. This shows 

A(S[ - ST^), S[_ 1 + S, r _e t - ST) > 0. 

Therefore dividing by r 2 and using the notation of linear interpolants, see (3.1), 
implies 

(A± T (t), T, h {t) - *f(t)^ > f.a.a. t G (0, T). 

This shows 

||E iT (t)-2E T (t)|U< ||S ir (t)|U f.a.a. te (0,T), (3.14) 

where ||-||^ is the norm on S 2 induced by A, see (2.2). 

To derive a similar formula for the continuous solution, we test (1.15) with 
T = S(f + h) + ^i(t)-t(t+h) f° r h > and obtain 

A±(t), T, m -i(t+h) + E(* + ft) - > 0. 

Passing to the limit h \ 0, using [Krejci, 1998, Theorem 8.14], yields 

AS(i), S^i) - E(t)^ < f.a.a. t G (0,T). 
Analogously, we obtain by h /• 

/AS(t), E^t) - E(t)^ > f.a.a. i G (0, T). 

Hence 

||5^(i)-2E(i)|U = ||^(t)|U f.a.a. te (0,T). (3.15) 
In order to apply Lemma A.l with the setting (3.13) we check the prerequisites. 
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(A.la): Lemma 3.2 ensures E T — s- S in L°°(0, T;S^) and the assumption i T -± I 

in ff^OjT; V') implies E £ . -> E £ in L°°(0, T; 5 2 ). 
(A. lb): By the linearity of I H- E|, the assumption f ->• £ in ^(Q.T; V) implies 

||E^(.)m->||Si(OIUmL a (0,T;H). 
(A.lc): This was shown in (3.14). 
(A. Id): This was shown in (3.15). 

Therefore, Lemma A.l yields 

||5V - 2 E T - (Ej - 2 E)||iji( , T;S 2 j -> 0. 

By the assumption i T — > £ in i/ 1 (0, T; V') we infer 

l|S r - E||Hi(o,TiSj) ^° asr\0. 

Using (1.19), (3.5) and the inf-sup condition of B, we obtain u T — >• it in 
if^O,! 1 ; V) as r \ 0. 



Using the representation formula (3.9) for A T we are able to prove the strong 
convergence of A r in L 2 (0, T; L 2 (f2)) towards A. We remark that this is a novel 
result for the analysis of (1.22). 

Let us mention that this strong convergence is crucial for the analysis in Wachsmuth 
[2012b]. Without this strong convergence, we could neither pass to the limit in the 
complementarity relation /i r A r = nor in the adjoint equation, see [Wachsmuth, 
2012b, Section 3[. Hence, this convergence is essential to prove the necessity of the 
system of weakly stationary type in Wachsmuth [2012b] . 



Theorem 3.4. Let £ T -> 

L 2 (0,T;L 2 (fi)) as r \ 0. 


1 in HL } (0,T;V') as r \ 0. Then A T - 


A in 





Proof. Step (1): Weshowthat A r A in £ 1 (0, T; L x (fi)). Using A[ |2?E[| 2 = d\ A[ 
by (3.8c) and (3.9), we obtain 

A[ = 4 K T>Y% : PET = J-jC-BT^ _ ^ ) . p S T). 
^o r(J o 
Using the notion of piecewise linear interpolations x r , E T and the piecewise con- 
stant interpolation A 7 ", we obtain for t € {{i — 1)t, it), see (3.2), 

A T (t) - ^ (H -1 * T W : (t) -{t-ir) Vt r (t)] ) . 

Using A = — H _1 x : PE/ct^, by (1.22), this implies 

a 2 |A T (t) - X(t)\ < |H- x x T (i) : ra T (t) - H- 1 *^) : 2?S(t) 

+ (t-ir) E.- 1 x T (t):Vt T (i) 

< 



- 1 X r (t):T>Yr(t)-W- l x(t):VV(t) 
- x X T (i):X»S T (t) 



Hence, 



|A r - A|| L1(0 , T;il( a)) < c (||x T || ||^£ r - 2?E|| + ||2?E|| \\± T - x\\ 

+ t\\x t \\\\v£ T 



OPTIMAL CONTROL OF QUASISTATIC PLASTICITY, September 6, 2012 



21 



where all norms on the right-hand side are those of L 2 (0, T; S). Using S T — > X in 
iJ^TjS 2 ) implies A T — > A in Z, 1 (0,T;L 1 (fl)). 

Step (2): We show that || A r \\l^{o,t-l^{q,)) -> II A||L 2 (o : T;L 2 (n))- Using that A r is 
the piecewise constant interpolation of (XJ); 



\X T \\ 2 



)i=i> we obtain 

N 



i=l 



L 2 (0,T;L 2 (fi)) 

Using A[ = ct A[ by (3.8c), we obtain 

XJVEJ\ 2 dx 



(xrydx. 



l^ T ||l 2 (0,T;L 2 (O)) 



T ^ 



-y A/ 



(by (3.9)) 



X \\l 2 (0,T;S)- 



An analogous calculation shows || All j^a 



2 (o,T;L 2 (n)) = J^IIH 1 A v IU 2 (o,T;S)- Hence, by 
S T — > S in i/ 1 (0,T;S' 2 ), see Theorem 3.3, we infer the convergence of norms 

ll^ T ||L 2 (0,T;L 2 (n)) l|A||i 2 (o,T;L 2 (n))- 

Step (3): We show that A T -> A in L 2 (0,T; L 2 (n)). By Step (2) we know 
that for every subsequence of r, there is a subsequence such that X Tk X in 
£ 2 (0, T; L 2 (fl)) for some A. Step (1) implies A = A. Hence, the whole sequence 
converges weakly. In view of the convergence of norms, A r converges strongly to A 
in L 2 (0,T;L 2 (f2)) as r \ 0. 



3.3. Weak convergence. In this section we show the weak convergence of the 
time-discrete states (£ r ,« r ) = Q T (Eg T ) in M,,(0,T;S 2 x V), under the assump- 
tion that the controls g T converges weakly towards g in H} Q -,(0,T; U). This result 
is essential for proving the approximability results in Section 3.4. Similarly to the 
weak continuity of the forward operator proven in Section 2.3, we need the weak 
convergence of the right-hand sides Eg T with respect to a stronger norm in space, 
see the discussion in the beginning of Section 2.3. 

Theorem 3.5. Let g T <G U n such that the linear interpolants converge weakly, i.e., 
there is g G HL } (0, T; U) such that g T g in HL } (0, T; U) as r \ 0. 

Then the solutions (S r ,M r ) of the time-discrete problem (3.4) with right-hand 
side t T = Eg T converge weakly in H^(0,T; S 2 x V) towards the solution (S,it) 
of (1.18) with right-hand side I = Eg as r \ 0. 



Proof. Due to the a-priori bound (3.7), there exists a weakly convergent subsequence 
of (E T , u T ), denoted by the same symbol, i.e. (S r , u T ) (£,&) in H 1 ^, T; S 2 x V) 
as t \ 0. As in the proof of Theorem 2.7, we shall show (£,«) = (E,it). Hence 
the whole sequence converges weakly and this simplification of notation is justified. 

Let T G L 2 (0, T; S 2 ) with T £ JC a.e. in (0, T) be arbitrary. Let i G {0, . . . , iV-1} 
and k G (0, 1) be given. Set t = (i + k)t. Testing (3.4a) with T(t) yields 

(At T (t) + B*u T (t), T{t) - S r (^ + 1) t)) > 0. 
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Using S t ((i+1)t) = S T (() + (l-/t)TS T (t) by (3.1), and integrating over k £ (0,1), 
i.e. t £ (i t, (i + 1) t) yields 
i.(i+x)r 

< / (AS (t) + B*u T {t), T(t) - S r (( Z + 1) t)) dt 

= / (AS +S*m t , r- S T )dt- - / (AS +BV, £ )dt. 

iir 2 r 

Hence, summing over i = 0, . , . , iV — 1 implies 

rT rT 

0< / (AS T + B*u T 1 T- S T )dt - - / (AS r + 5*ti T , if)dt. (3.16) 
Jo 2 J 

Due to the boundedness of (S T , u T ) in i? 1 (0, T; S 2 xV), the second addend goes to 
zero as r \ 0. For the first addend we can proceed as in the proof of Theorem 2.7: 
the application of Lemma 2.6 yields 

limsup / (AS T + B*u T , T - S r ) dt < / (AS + B*u, T — S) dt. 
t\o Jo Jo 

Together with (3.16) this implies 

0</ (AS + B*u, T - S) dt for all T G L 2 (0, T; S 2 ), T e /C a.e. in (0, T). 
Jo 

Therefore, (S,u) is the solution of (1.18). This implies (S,m) = (S,tt). 



3.4. The time-discrete optimal control problem. In this section we consider 
a time discretization of (P). The time-discrete controls belong to the space U , 
which is identified with a subspace of iJj O |(0, T; U) via the piecewise linear inter- 
polation (3.1). The discretization of the admissible set is given by U[ d = J7 a d H E/ , 
where r = T/N. Now, the time-discrete optimal control problem reads 



Minimize F(u T ,g T ) = ^(tt r ) + - ||g T ||ffi (0iT . r/) 

such that (S T , tt r ) = G T {Eg T ) 
and g r e LT" d . 



(P r ) 



For the analysis of (P T ) we need the additional assumption that all g £ U a a can be 
approximated by time-discrete controls g T £ t/J d . 



Assumption 3.6. In addition to Assumption 2.8 we suppose that for all g £ C/ a d, 
there exists g T £ E7J d , such that g T — >• g in i? 1 (0, T; [/) as r \ 0. 



We remark that this condition is satisfied for both examples of C/ a d given after 
Assumption 2.8. 

The aim of this section is to answer the question whether optimal controls of (P) 
can be approximated by optimal controls of (P r ). Similar results for a regularization 
of an optimal control problem of static plasticity have been proved in [Herzog et al. , 
to appear, Section 3.2]. These results are very common for the approximation of 
minimizers of optimal control problems and the technique of proof is applicable to 
various problems, see also Casas and Troltzsch [2002], Barbu [1981]. To be precise, 
we prove that 



• one global optimum of (P) can be approximated by global solutions of (P r ) , 
see Lemma 3.8, 
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• every strict local optimum of (P) can be approximated by local solutions 
of (P r ), see Lemma 3.9, 

• every local optimum of (P) can be approximated by local solutions of a 
perturbed problem (Pg), see Theorem 3.10. 

These results are very important for the analysis in Wachsmuth [2012b]. Without 
these results at hand, one cannot show the necessity of the optimality conditions 
therein. 

Although both of u T and g T are optimization variables in (P r ), for simplicity 
we refer solely to g T being (locally or globally) optimal. This is justified in view of 
the continuous solution map Q T of (3.4). 

First we check that there exists a minimizer of the time-discrete optimal control 
problem (P T ). 

Lemma 3.7. There exists a global minimizer of (P T ). 



Proof. In [Wachsmuth, 2012a, Section 2.1] we show in a more general framework 
that the solution operator Q T of (3.4) is Lipschitz continuous from (V) to (S 2 x 
V) . Since the control operator E from U N into (V) is compact, Q T is compact 
from U N to (S 2 x V) . Now we proceed similarly as in the proof of Theorem 2.9. 



Now, we are going to prove the approximation properties. 

Lemma 3.8. Suppose that Assumption 3.6 is fulfilled. Let {r} be a sequence 
tending to and let g T denote a global solution to (P T ). 

(1) Then there exists an accumulation point g of {g T } in i/ 1 (0,T; U) and 

(2) every weak accumulation point of {g T } in 7J 1 (0,T;C7) is in fact a strong 
accumulation point in ff 1 (0, T; U) and a global solution of (P). 



Proof. By Assumption 2.8, we can choose g <E /7 a d- By Assumption 3.6, there 
exists a sequence g$ £ U^ d which converges towards g in H} ~,(0,T; U). Hence, 
by (3.7) the corresponding displacements Uq converge towards Uq in i/| o j(0, T; V). 
Together with the continuity of ip, this implies the convergence of i^(wo>ffo)- Since 
g T is a global optimum of (P r ), we have F(u T ,g T ) < F(iIq~>5o)- Hence, {g T } is 
bounded in iJ 1 (0,T; U). This implies the existence of a weakly convergent subse- 
quence in this space. Therefore, assertion (1) follows by assertion (2). 

To prove assertion (2), let {g T } converge weakly towards g in i/ 1 (0,T;t/) as 
r \ 0. We denote by (£ r ,it T ) = Q T (Eg T ) the (time-discrete) solution to (3.4) and 
by (X, u) = Q(Eg) the solution to (1.18). Due to the weak convergence proven in 
Theorem 3.5, we have u T — u in H l {Q,T; V). 

Let g £ C/ a d with corresponding displacement it be arbitrary. By 
Assumption 3.6, there is g T G L7^j, such that g T -> g in H 1 (0,T;L 2 (T N ;W 1 )). We 
denote the corresponding displacements by u T . By Theorem 3.3 we infer u T — > u. 
We have 

F(u,g) < liminf F(u T ,g T ) by weak lower semicontinuity of F 

< liminf F(u T ,g T ) by global optimality of (u T ,g T ) 

= F(u,g). by convergence of (u T ,g T ) 

This shows that g is a global optimal solution. Inserting g = g yields the conver- 
gence of norms and hence the strong convergence of g T in 7J 1 (0,T; U). 
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Lemma 3.9. Suppose Assumption 3.6 is fulfilled. Let g be a strict local optimum 
of (P) w.r.t. the topology of 77 1 (0,T; U). Then, for every sequence {r} tending to 
0, there is a sequence {g T } of local solutions to (P T ), such that g T — >• g strongly in 
fr 1 (0,T;Ef) as r \ 0. 



Proof. Step (1): Let e > 0, such that g is the unique global optimum in the closed 
ball B e (g) with radius e centered at g. We define U a a — {J a( j n B e (g). 

Step (2): We check that Assumption 3.6 is fulfilled for U a d- Clearly we have 
g G U a d- Let g e [/ a( j be arbitrary. We construct a sequence {g T } C J7 a d, (f G £7^, 
such that g T — > g. Since t/ a( j satisfies Assumption 3.6, there are sequences of time- 
discrete functions (g T ), (g T ) C U a d, converging towards g and g, respectively. 

If \\g — g\\H 1 (o.T-,u) < £ i the convergence if — > g implies g T S B e (g), and hence 
g r e [7 a d n [/ ad for sufficiently small r. 

Otherwise, if ||g - gWn^o^u) = £, w e obtain lim||g - g r \\B l (o,T ; u) = £■ There- 
fore, there exists To, such that for all r < tq, we have \\g — g T \\H 1 {o.T-u) < e /2 and 
Wq ~ (f \\h 1 (q,T:U) > £ /2. Now, we construct a convex combination if of g T and g T 
which belongs to B e (g) and approximates g. We define the sequence if by 

9 =(1-V)9 +V 9 , with if = max 0, T ztt, 77: G [0,1]. 



< 



9 - 9 t \\hi(o,T;U) - e/2 
In case ||g — g T ||jji(o, t ; c/) > £, we find (all norms are those of ff 1 (0,T; £7)) 

\\g-g T \\ <(i-f)\\g-g T \\+f\\g-g T \\ 

= \\g-g\-s/2 {^-a T \\ + (\\9-9 T \\-e)\\ 9 
\g-g\-zn (fllg-9 r H + (llg-g r ll- £ )f 

= E. 

This shows ||g — g r ||_H-i(o,T:t/) < £ - Moreover, if S [0, 1] implies g T S f/ a( j. There- 
fore, if £ C/ a d. Further, lim||g — if ||hi(o,t ; c/) = £ implies 77 1 " — > and hence 
g T — >- g. This shows that t7 a d satisfies Assumption 3.6. 
Step (3): We define the auxiliary problem 

Minimize F(u,g) = ip{u) + - \\g\\ 2 H i 
such that {11,11) = G {Eg) 
and g e E/ ad n B E (g). 



(P«,e) 



Since g is the unique minimum in C/ a( j = £7 a d fl B E (j), it is the unique global 
minimizer of (P g , e ). Invoking Lemma 3.8 yields the existence of a sequence g T of 
global solutions of the associated time-discrete problem (PJ), such that g r — >• g in 

//';•'./:/). 

5*iep (^J: It remains to check that g r are local solutions of (P T ). The convergence 
g T — >• g in 7J 1 (0,T; {/) implies that there is a To such that ||g — g T ||,ffi(o,T:;y) < e /2 
holds for all r < To. Let t < tq and if G B E / 2 (g T ) H (7J d be arbitrary. By the 
triangle inequality we infer g T G B e (g). This implies g T G f/ a d n J7J d . The global 
optimality of g T implies F(£ T - u (g T ), g T ) < F(£ r '"(g T ),<7 T ). Hence, g T is a local 
optimum of (P T ) in the neighborhood B e / 2 {g T ). 
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Finally, we address the approximability of a local minimum, which is not assumed 
to be strict. Let g be a local optimum of (P) w.r.t. the topology of iJ 1 (0,T; U). 
We define the modified problem, see also Casas and Troltzsch [2002], Barbu [1981], 

Minimize F g {u,g) = i]>(u) + ~\\gf H i ( 0>T;U) + ~||<7 - 9\\m( ,T;U) ) 

such that (E,u) = g(Eg) \ ( P s) 

and g £ £/ ad J 

Clearly, g becomes a strict local optimum of (P g ). Analogously to (P r ) we define 
the time-discrete approximation (Pg). 

Theorem 3.10. Suppose Assumption 3.6 is fulfilled. Let g be a local optimum of 
(P) w.r.t. the topology of H 1 (Q,T;U). Then, for every sequence {r} tending to 0, 
there is a sequence (g T ) of local solutions to (Pg), such that g T — >• g strongly in 
H X (Q,T\U) asr\0. 

Proof. Since the additional term \\g — gW^i^ x-u) * s wea kly lower semicontinuous, 
this follows analogously to Lemma 3.9. 

Due to this theorem, we are able to derive necessary optimality conditions for 
(P) by passing to the limit with the optimality conditions of (Pg), see [Wachsmuth, 
2012b, Section 3]. 

A A convergence result in Bochner-Sobolev spaces 

For the proof of Theorem 3.3 we need the following result. It is a consequence of 
[Krejci, 1998, Theorems 8.5, 8.7] which are cited as Theorems A. 4 and A. 5 below. 

Lemma A.l. Let X be a Hilbert space and p £ [1, oo) be given. Let {u n } C 
W 1,p (0, T- X), {g n } C LP(0, T; R) be given sequences for n € N U {0} such that 



u n -> u in L°°(0, T; X) (A.la) 

g n ^g inLP(0,T;M), (A.lb) 

||wn(t)||x < 9n{t) a.e. in (0, T), for all n € N, (A.lc) 

\\u (t)\\x =9o{t) a.e. in (0,T). (A.ld) 
Then u n -> u a in W 1,p (0,T;X). 



In order to prove Lemma A.l, we show two auxiliary results. 

Lemma A. 2. Let A be a normed linear space. The sequence {x n } C X is assumed 
to be bounded in X and let V C X 1 be some subset of the dual space X' . If for 
some i6l 

f(x n ) — > f{x) as n — > oo (A. 2) 

holds for all f £ V, then (A. 2) holds for all / £ c\V, where the closure is taken 
with respect to the X'-norm. 



Proof. Let / £ clV be given. Then there exists {f n } C V with /„ — > f. Due to the 
boundedness of {x n }, there is a C > with ||jc n ||x < C and ||a;||jc < C- Let e > 
be given. Then there is a m € N such that ||/ m — /||x' < e/(3C). Since (A. 2) 
holds for / m , there exists neN such that \f m (x n ) — fm{ x )\ < e /3- This implies 

|/(^n) ~ f(x)\ < \f(x n ) - f m {x n )\ + \f m (x n ) - f m (x)\ + \f m (x) - f(x)\ 

< 11/ - fm\\x> \\x n \\x + \frn(x n ) ~ fm(x)\ + \\f~ frn\\x> \\x\\x < £■ 
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This proves that (A. 2) holds for /. 



Lemma A. 3. Let X be some Hilbert space and let {u n } C T / F 1,1 (0,T;X) and 
u G P^ li:L (0,T;X) be given. If u n -> U in L°°(0,T;X) and if {u„} is bounded in 
W 1,1 (0,T;X), then 

(u n (t),<p(t))dt-> [ (u(t), <p(t))dt for all p G C(0,T;X), (A.3) 
o ./o 

i.e. the sequence {u n } converges in (7(0, T; A)' with respect to the weak-*-topology. 



o 



Proof. Let us prove that (A.3) holds for all ip G C^Q.TjX). Due to W 1,1 (0, T; X) C 
(7(0, T;X), integrating by parts (see [Krejci, 1998, (8.24)]) and the convergence of 
u n in L°°(0,T;X) yields 

(« n (t), p(t)) dt = - I (u n (t), <p(t)) dt + (« n (T), p(T)) - MO), p(D)) 

T 

(u(t), <p(t)) dt + (u(T), ip(T)) - ( U (0), <^(0)> 

o 

T 

(u(t), ¥>(*)) dt 



for all 93 G C^TjX). Using clC^TjX) = (7(0, T;X) with respect to the 
IMlL oo (o,T;X)- nornl j Lemma A. 2 yields the claim. 



We remark that Lemma A.3 follows directly from [Krejci, 1998, Theorem 8.16], 
with the settings 

u n := u := (p, Cn := v n , £ := v, 

since Var[ 0j T] v n < c holds due to the boundedness of v n in W ' (0, T; X). The 
direct proof is presented for convenience of the reader. 



Theorem A. 4 ([Krejci, 1998, Theorem 8.7]). Let A be a Hilbert space and {v n } C 
2^(0, T; X), {g n } C L l {Q, T; R) be given sequences for n G N U {0} such that 

lim / (v n (t) - v (t), (p(t)) dt = for all ip G (7(0, T; X), (A.4a) 

n— >-oo Jq 

<?„-> 5 o in£ 1 (0,T;R), (A.4b) 

IK(*)IU ^ &»(*) a.e. in (0,T), for all n G N, (A.4c) 

\\v (t)\\x =g (t) a.e. in (0,T). (A.4d) 
Then u„ -> w in L^O, T; X). 



Theorem A. 5 ([Krejci, 1998, Theorem 8.5]). Let B be a Banach space and p G 
[l,oo) be given. Let {w„} C L p (0,T; B), {g n } C L p (0,T;R) be given sequences for 
n G N U {0} such that 

u n (*) -> "o(*) a.e. in (0,T), (A.5a) 

5„-> fl o inLP(0,T;R), (A.5b) 

IK(*)||s < 9n(t) a.e. in (0,T), for all n G NU {0}, (A.5c) 

Then v n -> w in 1^(0, T; B). 



OPTIMAL CONTROL OF QUASISTATIC PLASTICITY, September 6, 2012 



27 



Corollary A. 6. Let A be a Hilbert space and p £ [l,oo) be given. Let {v n } C 
L p (0, T; A), {g n } C L p (0, T; R) be given sequences for n £ N U {0} such that 

lim / (vjt) - v (t), <p(t)) dt = for all yp £ C(0, T\ X), (A.6a) 

n—too 

3„-> 5o inLf(0,T;R), (A.6b) 

IKWHa < a.e. in (0,T), for all neN, (A.6c) 

IKWHjt =90(*) a.e. in (0,T). (A.6d) 
Then v n -> w in L p (0, T; A). 



Proof. Applying Theorem A. 4 yields w„ — > v in L 1 (0,T; A). Thus there exists a 
pointwise convergent subsequence {v nk }. Hence, Theorem A. 5 yields v nk — > v in 
L p (0,T;A). Since we can start with an arbitrary subsequence of v n , we obtain 
V n -+v in LP(0,T;A). 



Proof of Lemma A.l. By (A. lb) and (A.lc) we obtain the boundedness of u n in 
T4 Al,1 (0,T; A). Lemma A. 3 yields (A. 3) and hence we can apply Corollary A. 6 for 
v n = u n - This yields u n — > u in L p (0,T; A) and hence u n — > u in W 1,p (0,T; A). 
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